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DETAILED ACTION 

Applicant's Amendment filed 09/29/08 has been entered and carefully 
considered. Claims 1-16, 18-26, 28-55, 57-65, 67-91, 93-101 and 103-111 have 
been amended. However, the limitations of the amended claims have not been 
found to be patentable over prior art of record, therefore, claims 1-111 are 
rejected under the same ground of rejection as set forth below. 

Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for 
all obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented 
and the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

Ciaims 1-2, 4-5, 13-15, 17, 37, 40-41,43-44, 52-54, 56, 76-77, 79-80, 88- 

90 and 92 a t s < cte - under 35 U.S.C. 103(a) as being unpatentable over Jain 

et al. [US. 6,567,980]. 

As to claims 1, 37, 40 and 76, Jain et al. disclose a computer 
implemented method and corresponding apparatus for displaying multimedia 
information comprising the steps/means for displaying a graphical user interface 
(GUI) on the display (column 4, lines 20-25); 

displaying, in a first area of the GUI, a first visual representation of the 
multimedia information stored in the multimedia document, the first visual 
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representation including a first representation of information of a first type 
stored in the multimedia document including a first representation of information 
of a first type stored in the multimedia document (figure 2, 172) and a first 
representation of information of a second type stored in the multimedia 
document (figure 2, the frame 172 contains both the first type: image and the 
second type: caption text), displaying, in the first area of the GUI, a first lens 
positionable over a plurality of portions of the first visual representation 
displayed within the first area of the GUI (figure 2, the first lens covers the frame 
172); the first lens covering a first portion of the first visual representation within 
the first area; and 

displaying, in a second area of the GUI (figure 2, area 176), a second 
visual representation of the multimedia information stored in the multimedia 
document based on the first lens covering the first portion of the first visual 
representation within the fist area (figure 2, one of the frame of area 176), the 
second visual representation including a second representation of the 
information of the first type stored in the multimedia document and a second 
representation of the information of the second type stored in the multimedia 
document. 

a representation of multimedia information displayed in the first portion 
of the first the representation of multimedia information displayed in the second 
area comprising a portion of the representation of information of the first type 



Application/Control Number: 1 0/081 ,1 29 Page 4 

Art Unit: 2179 

covered by the first lens and a portion of the representation of information of the 
second type covered by the first lens (figure 2, in the Capture process of the 
video in area 172, the frame 172 is captured and extracted from the process, 
then it is displayed in the second display 176. So, one of the multiple keyframes 
in the second display 176 is the extracted keyframe of the first display 172, see 
column 3, line 50 through column 4, line 40); Jain et al. fails to clearly teach 
displaying a first lens moveable in response to user input over representations 
of multimedia information displayed in the GUI. However, Jain cited "When 
video capture is initiated by the user, the video encoding process starts at a 
begin step 762 and moves to step 764 wherein the Video Cataloger 110 
(FIG.1 )." (figure 1 1 , lines 26-30). The user input is over representations of 
multimedia information displayed in the GUI by initiating the video capture by 
the user. The user puts a first lens covering the desired frame 172 by capturing 
and selecting the desired video clip frame 172. If the user wants to capture a 
next video clip (like frame 186), the user puts the first lens on the frame 186 by 
selecting and then displaying this frame in the panel 172. The video keeps 
moving and moving one video clip frame to another video clip frame. If the user 
wants a specific video clip frame, the user would put the first lens over the 
specific video clip frame by selecting and capturing the specific video clip frame. 
The first lens is moveable from one video clip frame to another video clip frame 
depending on user's selection of which video clip frame. Therefore, Jain et al. 
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suggest displaying the first lens moveable in response to user input over the 
video. 

It would have been obvious to one of skill in the art, at the time the 
invention was made, to combine a well known implementation of the suggestion 
to Jain. Motivation of the combination is for a flexibility of choosing the video clip 
frame. 

As to claims 2, 41 and 77, Jain et al. also disclose displaying a first 
thumbnail image in the first area of the GUI (figure 2, 172), the first thumbnail 
image comprising the first representation of information of the first type; and 
displaying a second thumbnail image in the first area of the GUI, the second 
thumbnail image comprising the first representation of the information of the 
second type (figure 2, column 4, lines 20-65). 

As to claims 4, 43 and 79, Jain et al. show determining a first time and a 
second time associating with the first lens; displaying, in the second area of the 
GUI, a representation of the information of the first type occurring between the 
first time and the second time associated With the first lens as the second 
representation of the information of the first type; and displaying, in the second 
area of the GUI, a representation of information of the second type occurring 
between the first time and the second time associated with the first lens as the 
second representation of the information of the second type (figure 2, 178, 
column 6, line 40-60 and column 7, lines 15 through column 8, line 60). 
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As to claims 5, 44 and 80, Jain et al. also show receiving user input moving 
the first lens to cover a second portion of the first area; and responsive to the 
user input, automatically changing second visual representation displayed in the 
second area of the GUI such that the second visual representation of the 
multimedia information stored in the multimedia document displayed in the 
second area of the GUI corresponds to the second portion of the first visual 
representation of the multimedia information stored in the multimedia document 
covered by the first lens (figure 2, receiving the user input to select another 
keyframe of the video in the first display 172 and then display in the second 
display 176). 

As to claims 13, 52 and 88, Jain et al. also teach the information of the first 
type corresponding to video information (column 2, lines 6-24); and the 
representation of the information of the first type comprising one or more video 
keyframes extracted from the video information (column 2, lines 17-22 and 
column 4, lines 20-40). 

As to claims 14, 53 and 89, Jain et al. provide the information of the second 
type corresponding to audio information (column 2, lines 18-20); and the 
representation of information of the second type comprises text information 
obtained from transcribing the audio information (column 2, lines 15-25). 
As to claims 15, 54 and 90, Jain et al. also provide the information of the 
second type corresponding to closed-caption (CC) text information; and the 
representation of information of the second type comprises text (column 4, lines 
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20-50). 

As to claims 17, 56 and 92, Jain et al. also provide receiving input indicating 
selection of a portion of the multimedia information occurring between a first 
time and a second time; and performing a first operation on the portion of the 
multimedia information occurring between a first time and a second time 
(column 6, lines 40-60 and column 12, lines 30-50). 

Claims 3, 6-7, 9-12, 18-36, 38-39, 42, 45-46, 48-51, 57-75, 78, 81-82, 84-87 
and 93-1 1 1 rejected under 35 U.S.C. 103(a) as being unpatentable over Jain et 
al. [US. 6,567,980]. 

As to claims 3, 42 and 78, Jain et al. disclose displaying the, in a first sub- 
area of the second area of the GUI, the second representation of the 
information of the first type as a portion of the first representation of the 
information of the first type covered by the first lens; and displaying, in a second 
sub-are of the second area of the GUI, the second representation of the of the 
first type as a portion of the first representation of the information of the second 
type covered by the first lens (figure 2, 172 (first area). 

Jain et al. fail to clearly teach a first panel and a second panel. However, it 
was well known in the art that the first and second panels are displayed in the 
GUI area (see figure 2, first area 172, the first portion contains image, the 
second portion contains text). It would have been obvious to one of ordinary 
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skill in the art at the time of the invention to display the first and second 
information types in the first and second panels in the second GUI area in order 
to provide a good and convenient GUI. 

As to claims 7, 46 and 82, Jain et al. fail to clearly teach the third area of the 
GUI. However, it was well known in the art to display the third area in the GUI. It 
would have been obvious to one of ordinary skill in the art at the time of the 
invention to display the third area of the GUI in order to provide a good and 
convenient GUI. 

As to claims 6, 45 and 81 , Jain et al. fail to clearly teach the third area of 
the GUI. However, it was well known in the art to display the third area in the 
GUI. It would have been obvious to one of ordinary skill in the art at the time of 
the invention to display the third area of the GUI in order to provide a good and 
convenient GUI. 

Jain et al. fails to clearly teach displaying a first lens moveable in response 
to user input over representations of multimedia information displayed in the 
GUI. However, Jain cited "When video capture is initiated by the user, the video 
encoding process starts at a begin step 762 and moves to step 764 wherein the 
Video Cataloger 1 1 0 (FIG.1 )." (figure 1 1 , lines 26-30). The user input is over 
representations of multimedia information displayed in the GUI by initiating the 
video capture by the user. The user puts a first lens covering the desired frame 
172 by capturing and selecting the desired video clip frame 172. If the user 
wants to capture a next video clip (like frame 186), the user puts the first lens 
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on the frame 186 by selecting and then displaying this frame in the panel 172. 
The video keeps moving and moving one video clip frame to another video clip 
frame. If the user wants a specific video clip frame, the user would put the first 
lens over the specific video clip frame by selecting and capturing the specific 
video clip frame. The first lens is moveable from one video clip frame to another 
video clip frame depending on user's selection of which video clip frame. 
Therefore, Jain et al. suggest displaying the first lens moveable in response to 
user input over the video. It would have been obvious to one of skill in the art, at 
the time the invention was made, to combine a well known implementation of 
the suggestion to Jain. Motivation of the combination is for a flexibility of 
choosing the video clip frame. 

As to claims 9, 22, 48, 61, 84 and 97, Jain et al. disclose receiving a user input 
moving the first lens over the second visual representation displayed within the 
second area to cover a second visual representation within the second of the 
first area and responsive the user input, automatically changing the information 
displayed in the second area of the GUI (figure 2, the user can selects another 
keyframe in the first display 172 and then display it in the second display 176). 
But, Jain et al. fail to clearly teach a third area in order to automatically change 
the information displayed in the third area of the GUI. However, it was well 
known in the art to display the third area in the GUI. It would have been obvious 
to one of ordinary skill in the art at the time of the invention to display the third 
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area of the GUI in order to provide a good and convenient GUI. 
As to claims 10, 49 and 85, Jain et al. disclose receiving a user input moving 
the first lens over the first visual representation displayed within the first area to 
cover a second portion of the first visual representation within area and 
responsive to the user input, automatically changing the second visual 
representation displayed in the second area of the GUI corresponds to the 
representation of multimedia information stored in the multimedia document 
covered by the first lens (figure 2, the first lens cover first and second portions 
of the keyframe in the first display 172). Jain et al. fail to clearly teach the 
information displaying in the third area of the GUI. However, it was well known 
in the art to display the third area in the GUI. It would have been obvious to one 
of ordinary skill in the art at the time of the invention to display the third area of 
the GUI in order to provide a good and convenient GUI. 
As to claim 11, Jain et al. teach displaying a sub-lens covering a portion of the 
first visual representation displayed within the area of the GUI corresponding to 
the first portion of the second visual representation within the second area of 
the GUI covered by the second lens (figure 2, column 6, 42-51 , the sub-lens is 
used to select any other keyframe in the first display 172). 

As to claim 12, Jain et al. also teach receiving a user input moving the 
second lens over the second visual representation displayed within the second 
area to cover a second portion of the second visual representation within the 
area; and responsive to the user input, automatically changing a position of the 
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sub-lens to cover a portion of the first visual representation displayed within the 
area of the GUI corresponding to the second visual representation within the 
second portion of the second area covered by the second lens (figure 2, column 
6, 42-51, the sub-lens is used to select any other keyframe in the first display 
172). 

As to claims 18, 38, 57 and 93, Jain et al. disclose displaying a graphical user 
interface, displaying, in a first area of the GUI, a representation of the 
multimedia information stored by the multimedia document occurring between a 
start time and an end time associated with the multimedia document (figure 2, 
178 and time line 180). Each of the video images occurs between a start time to 
an end time (column 6, lines 1-25). But, Jain et al. fail to clearly teach the step 
of displaying a first lens emphasized a portion of the first area of the GUI, the 
portion of the first area emphasized by the first lens comprising a representation 
of multimedia information occurring between a first time (t1) and a second time 
(t2). 

However, it was well known in the art to display a first lens emphasizing a 
portion of the first area of the GUI, the portion of the first area emphasizing by 
the first lens comprising a representation of multimedia information occurring 
between a first time (t1 ) and a second time (t2). 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to have the portion of the first area emphasized by the first lens 
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comprising a representation of multimedia information occurring between a first 
time (t1) and a second time (t2) in order to provide a good and convenient GUI. 

Jain et al. fails to clearly teach displaying a first lens moveable in response 
to user input over representations of multimedia information displayed in the 
GUI. However, Jain cited "When video capture is initiated by the user, the video 
encoding process starts at a begin step 762 and moves to step 764 wherein the 
Video Cataloger 1 1 0 (FIG.1 )." (figure 1 1 , lines 26-30). The user input is over 
representations of multimedia information displayed in the GUI by initiating the 
video capture by the user. The user puts a first lens covering the desired frame 
172 by capturing and selecting the desired video clip frame 172. If the user 
wants to capture a next video clip (like frame 186), the user puts the first lens 
on the frame 186 by selecting and then displaying this frame in the panel 172. 
The video keeps moving and moving one video clip frame to another video clip 
frame. If the user wants a specific video clip frame, the user would put the first 
lens over the specific video clip frame by selecting and capturing the specific 
video clip frame. The first lens is moveable from one video clip frame to another 
video clip frame depending on user's selection of which video clip frame. 
Therefore, Jain et al. suggest displaying the first lens moveable in response to 
user input over the video. 

It would have been obvious to one of skill in the art, at the time the 
invention was made, to combine a well known implementation of the suggestion 
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to Jain. Motivation of the combination is for a flexibility of chocs o ?he video dip 
frame. 

As to claims 19, 58 and 94, In light of rejection of the third area, Jain et al. 
show displaying a second lens emphasizing a portion of the second area of the 
GUI, the portion of the second area emphasized by the second lens comprising 
a representation of multimedia information stored in the multimedia document 
occurring between a third time (t3) and a fourth time (t4) (each video image has 
different start-end time. So, a first image has its start-end time (t1-t2) and a 
second image has its start-end time (t3-t4). 

Jain et al. fail to clearly teach the step of displaying, in a third area of the GUI, 
the representation of multimedia information stored in the multimedia document 
occurring between t3 and t4, the representation of multimedia information 
displayed in the third area comprising a representation of information of the first 
type occurring between t3 and t4 and a representation of information of the 
second type occurring between t3 and t4. 

However, it was well known in the art to display the third area in the GUI. It 
would have been obvious to one of ordinary skill in the art at the time of the 
invention to display the third area of the GUI in order to provide a good and 
convenient GUI. 

Jain et al. fails to clearly teach displaying a first lens moveable in response 
to user input over representations of multimedia information displayed in the 
GUI. However, Jain cited "When video capture is initiated by the user, the video 
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encoding process starts at a begin step 762 and moves to step 764 wherein the 
Video Cataloger 1 1 0 (FIG.1 )." (figure 1 1 , lines 26-30). The user input is over 
representations of multimedia information displayed in the GUI by initiating the 
video capture by the user. The user puts a first lens covering the desired frame 
172 by capturing and selecting the desired video clip frame 172. If the user 
wants to capture a next video clip (like frame 186), the user puts the first lens 
on the frame 186 by selecting and then displaying this frame in the panel 172. 
The video keeps moving and moving one video clip frame to another video clip 
frame. If the user wants a specific video clip frame, the user would put the first 
lens over the specific video clip frame by selecting and capturing the specific 
video clip frame. The first lens is moveable from one video clip frame to another 
video clip frame depending on user's selection of which video clip frame. 
Therefore, Jain et al. suggest displaying the first lens moveable in response to 
user input over the video. 

It would have been obvious to one of skill in the art, at the time the invention 
was made, to combine a well known implementation of the suggestion to Jain. 
Motivation of the combination is for a flexibility of choosing the video clip frame. 

As to claims 20, 59 and 95, Jain et al. fail to clearly teach a fifth and sixth 
time, however, in light of rejection of t1 , t2, t3 and t4, it was well known in the 
computer art to represent of multimedia information stored in the multimedia 
document occurring between a t5 and t6. It would have been obvious to one of 
ordinary skill in the art at the time of the invention to display the third area of the 
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GUI in order to provide a good and convenient GUI. 

As to claims 21, 23, 60, 62, 96 and 98, Jain et al. fail to clearly teach the step 
of changing the position of the second lens in response to user input such that 
the second lens emphasizes a portion of the second area of the GUI comprising 
a representation of multimedia information stored in the multimedia document 
occurring between a fifth time (t5) and a sixth time (t6). However, in light of 
rejection of the third area and t5 and t6, it would have been obvious to a person 
of ordinary skill in the art at the time of the invention to change the position of 
the second lens in response to user input such that the second lens 
emphasizes a portion of the second area of the GUI comprising a 
representation of multimedia information occurring between a fifth time (t5) and 
a sixth time (t6). The motivation would have been to provide a convenient and 
good interface. 

As to claims 24, 63 and 99, Jain et al. show the information of the first type 
being video information; the information of the second type being audio 
information; the representation of the information of the first: type comprising 
one or more video keyframes extracted from the video information; and the 
representation of information of the second type comprising text information 
obtained from transcribing the audio information (column 2, lines 6-24, column 
2, lines 17-22 and column 4, lines 20-40 and column 2, lines 15-25). 
As to claims 25, 64 and 100, Jain et al. also show the information of the first 
type being video information; the information of the second type being closed- 
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caption (CC) text information; the representation of the information of the first 
type comprising one or more video keyframes extracted from the video 
information; and the representation of the information of the second type 
comprising text information included in the CC text information (column 4, lines 
20-50). 

As to claims 26, 65 and 101, Jain et al. disclose receiving information 
indicating a first topic; and analyzing the multimedia information stored in the 
multimedia document to identify one or more locations in the multimedia 
information that are relevant to the first topic (column 9, line 52 through column 
10, line 21); wherein displaying , in the first area of the GUI, the representation 
of the multimedia information stored by the multimedia document occurring 
between ts and te in the first area of the GUI comprises highlighting the one or 
more locations in the multimedia information displayed in the first area of the 
GUI that are relevant to the first topic (column 8, lines 23-65); and wherein 
displaying the representation of multimedia information stored in the multimedia 
document occurring between t1 and t2 in the second area of the GUI comprises 
highlighting the one or more locations in the multimedia information that occur 
between times t1 and t2 (column 6, lines 28-67). 
As to claims 27, 66 and 102, Jain et al. teach receiving input indicating 
selection of a portion of the multimedia information occurring between a 
selection start time and a selection' end time; and performing a first operation 
on the portion of the multimedia information occurring between the selection 
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start time and the selection end time (column 6, lines 40-60 and column 12, 
lines 30- 50). 

As to claims 28, 39, 67 and 103, Jain et al. disclose displaying a graphical 
user interface; displaying, in a first area of the GUI, a representation of the 
multimedia information stored by the multimedia document occurring between a 
start time and an end time associated with the multimedia document (figure 2, 
178 and time line 180). Each of the video images occurs between a start time to 
an end time (column 6, lines 1-25). Jain et al. also show displaying, in first 
section of a first area of the GUI, a first set of one or more video keyframes 
extracted from the video information occurring between a start time and an end 
time associated with the multimedia document (column 6, lines 42-60); in light 
of rejection of t1 and t2, it would have been obvious to a person of ordinary skill 
in the art at the time of the invention to display a first lens emphasizing a portion 
of the first section of the first area occurring between a first time and a second 
time and a portion of the second section of the first area occurring between t1 
and t2, the emphasized portion of the first section of the first area comprising a 
second set of one or more video keyframes extracted from the video 
information. The motivation would have been to provide a convenient and good 
interface. 

Jain et al. fails to clearly teach displaying a first lens moveable in response 
to user input over representations of multimedia information displayed in the 
GUI. However, Jain cited "When video capture is initiated by the user, the video 
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encoding process starts at a begin step 762 and moves to step 764 wherein the 
Video Cataloger 1 1 0 (FIG.1 )." (figure 1 1 , lines 26-30). The user input is over 
representations of multimedia information displayed in the GUI by initiating the 
video capture by the user. The user puts a first lens covering the desired frame 
172 by capturing and selecting the desired video clip frame 172. If the user 
wants to capture a next video clip (like frame 186), the user puts the first lens 
on the frame 186 by selecting and then displaying this frame in the panel 172. 
The video keeps moving and moving one video clip frame to another video clip 
frame. If the user wants a specific video clip frame, the user would put the first 
lens over the specific video clip frame by selecting and capturing the specific 
video clip frame. The first lens is moveable from one video clip frame to another 
video clip frame depending on user's selection of which video clip frame. 
Therefore, Jain et al. suggest displaying the first lens moveable in response to 
user input over the video. 

It would have been obvious to one of skill in the art, at the time the invention 
was made, to combine a well known implementation of the suggestion to Jain. 
Motivation of the combination is for a flexibility of choosing the video clip frame. 

As to claims 29, 68 and 104, in light of rejection above about the 
displaying a second lens emphasizing a portion of the first section of the second 
area and a portion of the second section of the second area; and the video 
keyframes, it was well known in the computer art to have the step of "the 
emphasized portion of the first section of the second area comprising a third set 
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of one or more video keyframes extracted from the video information occurring 
between a third time (t3) and a fourth time (t4) and the third set of the video 
keyframes being a subset of the second set of one or more video keyframes. It 
would have been obvious to one of ordinary skill in the art at the time of the 
invention to display the emphasized portion of the first section of the second 
area comprising a third set of one or more video keyframes extracted from the 
video information occurring between a third time (t3) and a fourth time (t4) in 
order to provide a good and convenient GUI. 

Jain et al. fails to clearly teach displaying a first lens moveable in response 
to user input over representations of multimedia information displayed in the 
GUI. However, Jain cited "When video capture is initiated by the user, the video 
encoding process starts at a begin step 762 and moves to step 764 wherein the 
Video Cataloger 1 1 0 (FIG.1 )." (figure 1 1 , lines 26-30). The user input is over 
representations of multimedia information displayed in the GUI bv initiating the 
video capture by the user. The user puts a first lens covering the desired frame 
172 by capturing and selecting the desired video clip frame 172. If the user 
wants to capture a next video clip (like frame 186), the user puts the first lens 
on the frame 186 by selecting and then displaying this frame in the panel 172. 
The video keeps moving and moving one video clip frame to another video clip 
frame. If the user wants a specific video clip frame, the user would put the first 
lens over the specific video clip frame by selecting and capturing the specific 
video clip frame. The first lens is moveable from one video clip frame to another 
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video clip frame depending on user's selection of which video clip frame. 
Therefore, Jain et al. suggest displaying the first lens moveable in response to 
user input over the video. 

It would have been obvious to one of skill in the art, at the time the invention 
was made, to combine a well known implementation of the suggestion to Jain. 
Motivation of the combination is for 3 flexibility of choosing fhe v«d*o chp name. 

As to claims 30, 69 and 105, in light of rejection of claim 29, Jain et al. 
also teach outputting video information starting from t3 or from t4 or from a time 
between t3 and t4 in a first section of a third area of the GUI (column 6, lines 1- 
55 and column 13, lines 35-67); displaying a second lens positionable over a 
plurality of portions within the second area of theGUI. displaying text information 
corresponding to the information of the first type occurring between t3 and t4 in 
a second section of the third area of the GUI (column14, lines 26-67). It would 
have been obvious to one of ordinary skill in the art at the time of the invention 
to display the emphasized portion of the first section of the second area 
comprising a third set of one or more video keyframes extracted from the video 
information occurring between a third time (t3) and a fourth time (t4) in order to 
provide a good and convenient GUI. 

As to claims 31, 70 and 106, Jain et al. show the information of the first 
type being audio information (column 2, lines 10-25); and the text information 
corresponding to the information of the first type being obtained from 
transcribing the audio information (column 4, lines 40-67). 
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As to claims 32, 71 and 107, Jain et al also show the information of the first 
type being closed-caption text information, and the text information 
corresponding to the information of the first type being extracted from the CC 
text information (column 4, lines 20-60 and column 8, line 50 through column 9, 
line 20). 

As to claims 33, 72 and 108, Jain et al. fail to clearly teach a first set of one or 
more slides extracting from the slides information occurring between ts and te. 
However, Jain et al. show the images instead. It was well known in the 
multimedia information that images can be displayed instead. It would have 
been obvious to one of ordinary skill in the art at the time of the invention to 
display the images of the GUI in order to provide a good and convenient GUI. 
As to claims 34, 73 and 109, the claim is analyzed previously discuss with 
respect claims 29 and 33. 

As to claims 35, 74 and 110, in light of rejection above, Jain et al. fail to clearly 
teach the step of "a first set of one or more whiteboard images image extract 
from the whiteboard images information occurring between ts and te". However, 
the whiteboard images were well known in the multimedia information 
environment. It would have been obvious to one of ordinary skill in the art at the 
time of the invention to display the whiteboard images of the GUI in order to 
provide a good and convenient GUI. 

As to claims 36, 75 and 111, the claim is analyzed previously discuss with 
respect claims 28 and 35. 
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As to claim 50, Jain et al. teach displaying a sub-lens covering a portion of the 
first area of the GUI corresponding to the first portion of the second area of the 
GUI covered by the second lens (figure 2, column 6, 42-51 , the sub-lens is used 
to select any other keyframe in the first display 172). 

As to claim 51, Jain et al. also teach receiving a user input moving the second 
lens to cover a second portion of the second area; and responsive to the user 
input, automatically changing a position of the sub-lens to cover a portion of the 
first area of the GUI corresponding to the second portion of the second area 
(figure 2, column 6, 42-51 , the sub-lens is used to select any other keyframe in 
the first display 172). 

As to claim 86, Jain et al. teach displaying a sub-lens covering a portion of the 
first area of the GUI corresponding to the first portion of the second area of the 
GUI covered by the second lens (figure 2, column 6, 42-51 , the sub-lens is used 
to select any other keyframe in the first display 172). 

As to claim 87, Jain et al. also teach receiving a user input moving the second 
lens to cover a second portion of the second area; and responsive to the user 
input, automatically changing a position of the sub-lens to cover a portion of the 
first area of the GUI corresponding to the second portion of the second area 
(figure 2, column 6, 42-51 , the sub-lens is used to select any other keyframe in 
the first display 172). 
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Allowable Subject Matter 

Claims 8, 16, 47, 55, 83 and 91 are objected to as being dependent upon 
a rejected base claim, but would be allowable if rewritten in independent form 
including all of the limitations of the base claim and any intervening claims. 

Response to Arguments 

Applicant's arguments with respect to the feature of "displaying a first 
lens moveable in response to user input over representations of multimedia 
information displayed in the GUI." have been considered but are moot in view of 
the new ground(s) of rejection. 

Applicant has argued that Jain fails to teach "displaying, in a second area 
of the GUI, a representation of multimedia information displayed in the first 
portion of the first area, the representation of multimedia information displayed 
in the second area comprising a portion of the representation of information of 
the first type covered by the first lens and a portion of the representation of 
information of the second type covered by the first lens." 

However, the examiner respectfully disagrees with the argument. Jain 
discloses a panel 172 displaying the live video being digitized, with play, 
stop. ..The frame portions are displayed in section 184 such as frames 186, 188, 
1 90, 1 92, 1 94 that are video clips of the live video. A user captures a desired 
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portion (frame portion 172) of the live video including the frame portions of 
video clips. So, the live video includes video clips such as portions 172, 186, 
188, 190, 192, 194. The user puts a first lens covering the desired frame 172 by 
capturing and selecting the desired video clip frame 172. If the user wants to 
capture a next video clip (like frame 186), the user puts a second lens on the 
frame 186 by selecting and then displaying this frame in the panel 172. 
Therefore, Jain discloses displaying the first lens covering a portion of the first 
area. Jain discloses multiple frame portions in the second area 176. Each 
portion includes two types of information: image and text. Text is displayed at 
the bottom of each image in the second area 176. Applicant's attention is 
directed to image 202. The text information is displayed at the bottom of the 
image 202. It is clearly that Jain discloses a second area that presenting 
portions of the first and second types of information covered by the first lens. 

Applicant also argued Jain fails to provide any active correlation between 
the act of displaying frames in the preview window and the act of displaying 
information in the other portions of the GUI. 

However, the Applicant has argued the limitations (any active correlation 
between the act of displaying frames in the preview window and the act of 
displaying information in the other portions of the GUI) which are not recited in 
the claim. During patent examination, the pending claims must be "given >their< 
broadest reasonable interpretation consistent with the specification." > In re 
Hyatt, 211 F.3d 1367, 1372, 54 USPQ2d 1664, 1667 (Fed. Cir. 2000). Although 
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the claims are interpreted in light of the specification, limitations from the 
specification are not read into the claims. See In re Van Geuns, 988 F.2d 1 181 , 
26 USPQ2d 1057 (Fed. Cir. 1993). 

Applicant always has the opportunity to amend the claims during prosecution, 
and broad interpretation by the examiner reduces the possibility that the claim, 
once issued, will be interpreted more broadly than is justified. In re Prater, 415 
F.2d 1393, 1404-05, 162 USPQ 541, 550-51 (CCPA 1969). 
Also, the Applicant does not specify the invention in the claimed language. The 
claimed language itself "displaying, in a second area of the GUI, a 
representation of multimedia information displayed in the first portion of the first 
area, the representation of multimedia information displayed in the second area 
comprising a portion of the representation of information of the first type 
covered by the first lens and a portion of the representation of information of the 
second type covered by the first lens" is still broad. It does not specify the 
present specification. 

Applicant has also argued that Jain fails to teach displaying a first thumbnail 
image in the first area of the GUI and displaying a second thumbnail image in 
the first area of the GUI. 

However, the examiner respectfully disagrees with the above argument. The 
thumbnail (the current frame 172) in the first area (area 172) comprises a first 
thumbnail image (which is the video picture) and a second thumbnail image 
(which is the text information underneath the video picture. The text information 
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is similar to a text in figure 3, 314, 322 of the present invention). So, the first 
and second thumbnail images are displayed in the first area of the GUI (figure 
2, area 172). 

Applicant argued that the first and second thumbnails of first recorded 
information and third and fourth thumbnails of second recorded information 
different from the first recorded information. However, Jain et al. teach 
displaying a second representation of second stored information, the second 
stored information (frames are in area 176) comprising information of a first type 
and information of a second type (one of a frame 176 contains both the first 
type: image and the second type: caption text), the second representation 
comprising a representation of information of the first type included in the 
second stored information and a representation of information of the second 
type included in the second stored information; 

displaying a third thumbnail comprising a representation of information of a first 
type included in a second recorded information different from the first recorded 
information (the panel 172 comprises pluralities of frames, a first frames 
represents a first recorded information, a second frames represents a second 
recorded information, the first frame is different from the second frame in data. 
Thus, the first recorded information is different from the second recorded 
information); and displaying a fourth thumbnail comprising a representation of 
information of a second type included in the second recorded information. It is 
similar to a second representation of second stored information comprising a 
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first type and second type of information. The third and fourth thumbnails 
comprising a representation of information of a first type and a second type is 
one of a frame 176 which contains both the first type: image (read "the third 
thumbnail" and the second type: caption text (read "the fourth thumbnail); 
Applicant has argued that Jain's panel 1 72 of figure 2 does not appear to show 
highlighting. However, the one of the keyframes of the panel 172 is selected to 
extract from the process. The step of selection one of the keyframe makes its 
appearance highlighted and noticeable. A section of the selected keyframe of 
panel 172 including the first representation: image and the second 
representation: text. Once a user selects one of a thumbnail of the live video in 
panel 172, an appearance of the selected thumbnail is changed its color to 
darker to indicate the thumbnail being highlighted. Then, the selected and 
highlighted thumbnail is displayed in the panel 176. 
The claimed language is not clearly defined the invention specification. 
During patent examination, the pending claims must be "given >their< broadest 
reasonable interpretation consistent with the specification." > In re Hyatt, 21 1 
F.3d 1367, 1372, 54 USPQ2d 1664, 1667 (Fed. Cir. 2000). Although the claims 
are interpreted in light of the specification, limitations from the specification are 
not read into the claims. See In re Van Geuns, 988 F.2d 1 181 , 26 USPQ2d 
1057 (Fed. Cir. 1993). 

Applicant always has the opportunity to amend the claims during prosecution, 
and broad interpretation by the examiner reduces the possibility that the claim, 
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once issued, will be interpreted more broadly than is justified. In re Prater, 415 
F.2d 1393, 1404-05, 162 USPQ 541, 550-51 (CCPA 1969). 
Applicant has argued that Jain does not teach or suggest information of a 
second type. However, panel 172 contains two types of information: image and 
text. Image is considered as the information of the first type; Text (below the 
image) is considered as the information of the second type. The claim language 
"information of the second type" is a very broad term. It does not limit to 
"caption text" and thus text below the image can be read as "information of the 
second type". 

Applicant has also argued that Jain does not teach or suggest the second 
recorded information which is different from the first recorded information. 
However, the panel 172 comprises pluralities of frames, a first frame represents 
a first recorded information, a second frame represents a second recorded 
information. The first frame and the second frame are different in data and thus 
the second recorded information is different from the first recorded information. 
Then, the first and second recorded information are displayed separately in 
panel 176. Therefore, panel 176 displays first and second recorded information 
as well. 

Conclusion 

THIS ACTION IS MADE FINAL. Applicant is reminded of the extension 
of time policy as set forth in 37 CFR 1.136(a). 
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A shortened statutory period for reply to this final action is set to expire 
THREE MONTHS from the mailing date of this action. In the event a first reply 
is filed within TWO MONTHS of the mailing date of this final action and the 
advisory action is not mailed until after the end of the THREE-MONTH 
shortened statutory period, then the shortened statutory period will expire on 
the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .1 36(a) will be calculated from the mailing date of the advisory action. In 
no event, however, will the statutory period for reply expire later than SIX 
MONTHS from the mailing date of this final action. 

Any inquiry concerning this communication or earlier communications 
from the examiner should be directed to Mylinh Tran. The examiner can 
normally be reached on Mon - Thu from 7:00AM to 3:00PM at 571-272-4141 . 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Weilun Lo, can be reached at 571-272-4847. 

The fax phone numbers for the organization where this application or 
proceeding is assigned are as follows: 

571-273-8300 
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Information regarding the status of an application may be obtained from 
the Patent Application Information Retrieval (PAIR) system. Status information 
for published applications may be obtained from either Private PAIR or Public 
PAIR. Status information for unpublished applications is available through 
Private PAIR only. For more information about the PAIR system, see 
http://pair-direct.uspto.gov. Should you have questions on access to the Private 
PAIR system, contact the Electronic Business Center (EBC) at 866-217-9197 
(toll-free). 

Mylinh Tran 
Art Unit 2179 



/Weilun Lo/ 

Supervisory Patent Examiner, Art Unit 2179 



